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Tone Reproduction for Realistic Images 
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Display methods should 
compensate for all light- 
dependent changes in the 
way we see. Better tone 
reproduction for computer 
graphics will help solve display 
range problems. 



gflp adiosity and other global illumination methods for image 
Brasynthesis calculate the "real world" radiance values of a 
scene instead of the display radiance values that will represent 
them. Though radiosity and ray tracing methods can compute 
extremely accurate and wide-ranging scene radiances, modern 
display devices emit light only in a tiny fixed range. The radi- 
ances must be converted, but ad-hoc conversions cause serious 
errors and give little assurance that the evoked visual sensa- 
tions are truly equivalent. Common methods of conversion can 
fail spectacularly for extreme lighting conditions because they 
ignore light-dependent changes in the way we see. 

These conversions deserve attention because better ones are 
easily implemented by computer. Sensation -preserving con- 
versions for display are already known in photography, printing, 
and television as tone reproduction methods, Computer graph- 
ics workers can apply the existing photographic methods, but 
may also extend them to include more complex and subtle ef- 
fects of human vision using the published findings of vision re- 
searchers. We will demonstrate how to construct a 
sensation-preserving display converter, or tone reproduction 
operator, for monochrome images. See the sidebar for an ex- 
planation of why tone reproduction matters. 



Radiance values appear to be exactly as we see them, but hu- 
mans are very poor judges of absolute radiance. ? The eye's sen- 
sitivity to light varies with wavelength, even within the visible 
band. More importantly, the visual system is far better at sens- 
ing spatial and temporal changes in radiance than its absolute 
value. Humans apparently form perceptions of light strength 
from judgments of these changes. Good tone reproduction 
needs at least three different measures of light: the absolute 
magnitude of light energy in the visible band, the magnitude of 
light sensible to the human eye, and the perceived strength of 
the light. We use radiance, luminance, and brightness respec- 
tively for these three measures. 

Radiance is the quantity of electromagnetic energy per unit 
projected area, time, and solid angle. The spectral radiance is the 
radiance per unit wavelength at a particular wavelength. In gen- 
eral, global illumination methods such as radiosity or path trac- 
ing compute spectral radiance. Radiance can be measured with 
a radiometer. 

Luminance is the physical measure of light sensible by a "stan- . 
dard" human eye. The luminance is obtained by integrating the 
spectral radiance weighted by the eye's luminous efficiency 
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Tone Reproduction 



Why t @ne reproduction matters 



Many computer graphics image synthesis algorithms can't 
tell the difference between night and day — differences that 
are obvious to any human eye. Figure A depicts a room of 
uniformly diffuse surfaces lit by a single extended light 
source with adjustable emitted power. Suppose we choose 
two settings, one at an emitted power equal to that of two 
squashed fireflies and another with the power of an anti- 
aircraft searchlight The global illumination solution is linear 
in source radiance; that is, results for any two light source 
strengths are directly proportional. Accordingly, the image 
of the room with firefly lighting is identical to the image of 
the room lit by a searchlight, except for a scale factor of 
about 10". 

How should such firefly- and searchlight-strength room 
images be displayed? One widely used method normalizes 
all computed scene radiances by the value of the strongest 
nonemitting surface in the image, then these normalized 
values drive a gamma-corrected display device. This method, 
used for Figure A, gives the appearance of pleasant office 
lighting. This ad-hoc normalization removes the scale factor 
that distinguishes the firefly- and searchlight-powered room 
images; thus, Figure A is the displayed result for both of 
them, absurd as it seems. 

Human observers see these rooms quite differently. To a 
dark-adapted viewer, the ftrefly-powered room appears as 
little more than a very dim area, while an observer of the 
searchlight-illuminated room might see only harsh shadows 
against glaring white. 
Neither observer would 
recognize Figure A — it is 
wholly inaccurate. Tone 
reproduction operators 
correct these deficiencies. 

Figure A is also a 
dubious result for 
moderate illumination. As 
discussed in Meyer et af., 1 

the rendering process has 

two steps: the calculation 

of scene radiances, and 

the use of principles of 

perception to map these 

radiances to the display 

device. To be treated 

scientifically, the rendering 

process must be subject to 

verification at each stage. 

A tremendous amount of 

recent research deals with 

rigorous, first principle 

methods for the first step. 

Researchers have studied 

color transformations in. 

the second step as well. 2 



However, the mapping of real world radiances to the display 
device (tone reproduction) has been generally ignored, 

Fine image details in Figure A were very expensive to 
compute, yet the human observers in the firefly- and 
searchlight-illuminated rooms cannot see them. A good 
tone reproduction operator can predict visibility thresholds 
for errors in each region of the displayed image and avoid 
these wasted efforts. By setting the precision of the 
computed radrosity solution, tone reproduction addresses 
one of the basic questions remaining in global 
illumination — how accurate does a solution have to be? 

Figure A lacks the strongly light-dependent effects 
characteristic of human vision needed for good tone 
reproduction. The subjective accuracy of the displayed 
image can be improved by including models of the complex, 
dramatic changes to human vision that occur over the 
firefly-to-searchlight range. People have strong differences 
in sensitivity, acuity, contrast perception, and color 
sensitivity; high contrast effects such as glare, dazzle, 
afterimages, color washout, and diffraction; spatial effects 
such as Mach banding and hyperacuity; and temporal 
effects such as changes in adaptation, thermal noise, and 
motion blur. All of these effects contribute to realistic 
images, but useful results are demonstrated here using just 
a. few of them. We wish to stress the importance of explicit 
tone reproduction operators and encourage their use, rather 
than to champion any particular expression. 



Figure A* Display of radiopity solution using ad-hoc scale factor. 
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Figure 1. The tone reproduction problem: What operator will! t 



i a close match besween real-world and display brightness sensations? 



curve over the entire spectrum. Luminance can be measured by 
a photometer. 

Brightness is the magnitude of the subjective sensation pro- ' 
duced by light. Brightness cannot be measured by a meter and 
is known only by indirect psychophysical experiments. Here 
we use ^brils," a linear scale of absolute brightness described by 
Stevens," where 1 bril equals the sensation induced in a fully 
dark-adapted eye by a one-second exposure to a five-degree 
white target of one microlarnbert (0.0314 cd/m 2 ) luminance on 
a completely black background. Some workers disagree with 
Stevens' generalization of his results, but his extensive supra- 
threshold vision experiments provide convenient data for a sim- 
ple tone reproduction operator. 

Tone reproduction is only necessary because the eye's input 
range dwarfs the luminous output range of existing displays. 
Thus, direct reproduction is almost never possible. For example, 
most direct-view CRTs have a luminance range between I to 100 
cd/m 2 s but shadows in a starlit forest are just barely visible at 
about 1 0-* cd/m 2 , while a glinting snow ba nk might emit 10 5 cd/m 2 . 

In this article, we consider tone reproduction for gray-scale 
images only. For a gray environment, the spectral radiance is 
uniform for all wavelengths, and the luminance is just a constant 
times the uniform spectral radiance. Each input value to a gray 
dispJay device causes a unique output luminance, unlike a color 
display in which two different RGB triplets may have the same 
luminance. For gray images, then, a method for accurate tone 
reproduction is completely defined by describing how the sim- 
ulated real-world luminance is mapped to the luminance on the 
display device. 



A ggesteraf 1 

We have built a general framework to define tone reproduc- 
tion from the response of two observer models and a display sys- 
tem model, as shown in Figure t. An observer model is a 
mathematical model of the human visual system that includes 
all desired light-dependent visual effects while converting real- 
world luminance images to perceived brightness images. We 
denote luminance with L, and brightness with B. M ReaJ world'* 
values are given a subscript "rw," while "display' 1 values are 
given a subscript "d " 

We use two observer models. The real world observer views 



Ibe desired luminance image and corresponds to a human visi- 
tor to the room in Figure A, while the display observer views the 
luminance values of the display device. The display model con- 
verts display input values n to viewed luminance values L a . it in- 
cludes effects of ambient room light and CRT performance. 
The tone reproduction operator converts real-world luminances 
to display input values, where « is chosen to closely match 
the two observer model outputs B rw and 

Expanding Figure 1 as shown in Figure 2 reveals the tone re- 
production operator: It is the concatenation of the real-world 
observer, the inverse of the display observer, and the inverse of 
the display model. If each is known and robust, then a tone re- 
production operator follows easily. 



Television and film systems have roughly sigmoid responses 
to light when plotted on log/log axes, as in Figure 3, and both use 
similar equations and nomenclature. At the camera, each trans- 
lates real-world luminances l w to display luminances L a in two 
steps: First the real world luminances are encoded (as trans- 
parency T for film and voltage V for television), then decoded 
to display luminances L d . Combined encoding and decoding 
equations describe each system's tone reproduction. Hie cen- 
ter region of Figure 3 is approximated as a line segment with 
three parameters: "latitude" is the segment's width. T is its 
horizontal offset or film speed, and^gamma" (y) is its slope. 
These three parameters and "£> log £" or "II~D" plots {named 
for Hurter and Driffield, who devised them in 1890), similar to 
Figure 3, are widely used to describe film performance because 
all are chemically or optically controlled. 5 

Film encodes real-world luminances within its latitude as 
transparency values according to 



(1) 



where 

T= film transparency (0 for perfect opacity, 1 for perfect 
transparency) 

a, - a constant factor depending on the film speed and on 

choice of exposure lime, lens, and camera aperture 
Suppose the film is viewed as a transparency placed on a light 
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Figure 2. A sianple solution: Make a tone reproduction operator by concatenating a real-worhl «foseirv«r model, an inverse 
and ass inverse display device model. 



display observer model, 



table that has luminance L s . The luminance is decoded from 
the film according to 



l<*=L h T 



m 



Thus film converts real-world luminance: £^ to displayed lumi- 
nance L. as 



(3) 



Since a, and y are set by the photographer's choice of film, lenses, 
exposure time, aperture, and darkroom processes, human judg- 
ments will strongly affect the resulting image. 

Television system response also resembles Figure 3. Over 
the camera's latitude, real-world luminances L w are encoded to 
V, then decoded to display luminances as 



(4) 



(5) 



where 

V— normalized video signal, with 0 < V < 1. 

a 2 = constant set by camera sensitivity, lens, and aperture. 

Y«<n = "gamma correction" to compensate for excess receiver 

CRT gamma and to reduce visibility of transmission noise 

at the receiver. Typically equal to (standardized for 

television CRTs). 
y m = the standardized television CRT gamma, in the range 

2.8 to 3.0. 

a 3 ~ maximum CRT luminance; typically 48 < a^< 127 cd/rn-. b 

Combined encoding and decoding yields Equation 6, which 
is identical to Equation 3 if constants a 2 > a 7 and y cam , y ai are 
combined: - 



£ d — a;, (a 2 Z^jifciMiTm 



<6) 



MeasBaara observer effects in 
film and f etavlsfon 

Linear reproduction, where a's are nominal and y's equal 1 .0, 
are poor choices for good tone reproduction. The best choices 



Figure X Hurler and Driffield's "characteristic curve" for 

photographic film. 



"flare" 



"fog" 



D=log lo (1/T) 



fstope ^ y 



"knee" region 



"latitude" 



"shoulder" region 



depend heavily on the strength of the displays and their sur- 
rounding room lighting: They vary according to light-depen- 
dent changes tn the way we see. Subjective image quality 
experiments for film 5 and television 7 show ideal y or y ram y cu val- 
ues fall between about 1.1 and L5. Television viewers prefer de- 
creasing y^ m y crt as surroundings brighten, with L5 preferred for 
darkness, 1.2 for dim light (14 cd/m z ), and approaching 1 .0 for 
bright light. Similarly, reflection prints (film mounted on white 
paper backing) with y - 1,0 are overwhelmingly rejected as too 
dark and lacking in contrast. However, when viewed as a 
strongly backlit transparency in a dimly lit room, the same film 
is preferred over any other reflection print. 

C&sttgronenrls of a 
toift@-repr&ducfion operator 

Simply applying film tone-reproduction operators to syn- 
thetic images might be inadequate. Film and television repro~ 
duction is primarily concerned with the light-dependent effects 
on the display observer alone; real-world observer effects are 
left to the judgment of the camera operator, who may change 
exposure settings or even adjust scene radiances to improve 
the displayed result For example, the practice of "day for night 
shooting^ uses optical filters and high-contrast, low-angle light- 
ing to achieve the appearance of night while filming motion 
pictures in daylight. However, a practical operator for accurate 
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Figure 4. Stevens' 2960 es^erimesaft; Measared brsghtaess versiBs 

i at vairifljtis adaptation levels. 



computer graphics must be judgment-free, since images often 
represent real -world scenes never viewed before. We build such 
an operator by finding robust display and observer models. 

One way to build an observer model is to approximate the 
eye's performance with the same sort of power-law relations 
used for film. However, the eye adjusts to its surroundings, so 
the sensitivity and contrast compression constants a t and 7 must 
be made to change with the viewed luminances Z, ifl ; name these 
new functions and aiL^)* respectively. The power-Jaw re- 

lationship between brightness B and L in is then 



(7) 



Equation 7 applies to both the real-world observer (L^ = L n > 
and the display observer (L in — L d ), This simple observer model 
is not new: It is used implicitly in Jones's graphical method for 
photographic tone reproduction, as discussed by James/ and 
agrees with the "power law" models of human neural response 
advocated by Stevens. 8 We will derive lO^i^ and a(L in ) from 
their data. 

Stevens and Stevens 3 attempted to measure the entire gamut 
of the human brightness versus luminance response. In their 
experiments, an observer's eye was allowed to adapt thoroughly 
to a uniform white background luminance, Then they briefly 
presented a small gray target with luminance L,^ against this 
background and assessed brightness. They measured sensations 
from "black" to "white" over most of the usable range of the 
eye. (Brightness values beyond this measured range might be 
analogous to the "knee" and "shoulder" regions of Figure 3. 
For simplicity's sake, we ignored these regions, even though 
this caused ''clipping" in the results shown in Figure 5 .) Figure 
4 presents their results, Luminances were expressed in decibel 
units (dB), where 0 dB is defined as 10" 10 lamberts (3,18 x ID" 7 
cd/m 2 ). Stevens 1 background luminance S (in dB) is converted 
to L w in lamberts by 



5 = 100+ 10 loguXA,) 



<&0 



Stevens denoted the ratio between the target luminance JL^ 
in lamberts and the background, adapting luminance L v as tar 
get reflectance R in dB: 



tf = l»]og tD (^ w / 



(8b) 



Using S and Stevens summarized the data in Figure 4 by 
Equation 8c, in a fonn that reveals the "brightness constancy" 
contour at R = SA dB, where target brightness appears constant 
over a wide range of background luminance: 

l °Kio(#) = 0.004[(J -27)(R.4- Ry- 108j (Be) 

Since Equation 8 describes line segments on log/log axes, it 
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can also be writien in the form of Equation 7, with luminance 
expressed in lamberts: 



<x(L ia ) = 0.4 log l0 (LJ + 2.92 



(9) 



${LJ = -a4(]og m (X w )) 2 + (-2.584 lo gK ,(Z, w )) + 2.0208 (10) 

To complete the observer model, we need a workable value 
of L v from the input luminances Z- in : What uniform white back- 
ground luminance would cause an amount of adaptation equiv- 
alent to that of the complicated image L iti ? James 5 noted that 
several workers used average image luminance as an adaptation 
measure. We chose instead to use the assumption that the eye 
adapts in an attempt to keep most brightnesses near the "bright- 
ness constancy" contour of 8.4 dB below in Figure 4, so 



log^J-^flog^^iJJ + O.^ 
where F{log 10 (L in )J is the expected value of log^I^). 



<U) 



The eye 's response to light is still not well understood. It is dif- 
ficult to quantify because vision blends smoothly with higher 
brain functions and because the eye's behavior is strongly de- 
pendent on the content of the viewed image. Brightness re- 
sponse is usually described by several processes, including at 
least adaptation, simultaneous contrast, brightness- and color- 
constancy, memory, and cognitive processes. Many of these are 
interdependent, self-adjusting, and difficult to measure sepa- 
rately. Each tends to obscure the others, so brightness rules in- 
ferred from simple tests often fail when applied to more 
complex images. 10 Since brightness can only be measured indi- 
rectly, some researchers such as Schrieber 11 suggest that no 
quantitative model is above suspicion. We chose Stevens 1 model 
for its mathematical convenience. 

A siBuple inv&rs© CUT display gB3&sl@l 

We can build a display model from the television- decoding 
equation (Equation 5) with an additive term for the ambient 
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Tons Reproduction 




Figure 3. Nfiw tane-repraductiois 
wjpeiator applied do radirait 5 
sultitirats of Figure A. 



lighting falling on the CRT face. This ambient light also places 
an upper bound on the dispiay's available contrast; 

contrast = (LJL 6 mait ) = + (BG/I d 1IH J 

where 

L dnwx = maximum possible screen luminance (typically equal 
to86cd/m*X 

Y d = approximately 2.9 for uncorrected CRTs, or 1,2 for dis- 
plays with television-standard gamma correction. 

n = frame buffer value (0<R~G = B = n<\) used as display 
device input. 

BG = screen background luminance in cd/m 2 , defined as am- 
bient luminance multiplied by screen reflectance plus CRT 
secondary internal reflections. 

L d — display luminance in cd/m 2 . 

Let the maximum contrast ratio possible between on-screen 
luminances be denoted as C maK . Now approximate the ratio 
(BG/L d max ) as (l/C' maK ). A typical value of C mgiX for direct T view 
CRTs is about 35, 6 We can form the inverse display system 
model that gives the display device input for the required dis- 
play luminance L d simply by solving for n: 



(12) 



A complete 
toiie'reprodHcf Ion operator 

Now all the pieces are in place to construct a tone reproduc- 
tion operator as shown in Figure 2. First, change variable names 
in Equation 7 to create real-world and display observer 
equations; 

B d ~l&*L a «* 



The two observers are connected "back- to-back" as in Figure 
2 by setting 8^ = B d . We can then solve for display luminance 
in terms of real -world luminance: 



(13) 



Note that ot d and p d are functions of the adapting luminance 
of the display, L w(<n , This cannot be computed exactly from 
Equation 11 without knowing L d . To get around this "chicken 
and egg" problem, we approximate L w ^ as constant. Since the 
total range of CRT output is small, changes in L d have little ef- 
fect on ota and p d . Assume display luminances are uniformly 
distributed on a logarithmic scale, with log l0 (iL wid ^) as its mid- 
point. Then Equation 13 can be completed using 



/ *^*d max 



(14) 



Finally, Equation 13 is substituted directly into the inverse 
display system model of Equation 12 to form a complete tone 
reproduction operator: 



L, 




<15) 



Figure 5 shows the results of this operator when applied to the 
radiosi ty solutions of Figure A (repeated in the lower left cor- 
ner for comparison). Please note that these images were en- 
tirely the result of our new tone reproduction operator 
expressed by Equation 15 —no "tweaks" were used for display. 
This hands-off, judgment-free property alone justifies tone re- 
production operators. Output values n are clipped to 1,0 and 
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drive a CRT display where y c ,< - 1.2, as in Equation 5, and real- 
world luminances cover the searchlight-to-firefly range. The 
brightest image resulLs from overhead lamp luminance al 1,000 
lamberts (3.18 x 10 6 cd/m 2 ), and the lamp luminance is reduced 
by a factor of 100 with each successive image. The gray-scale 
strip shows steps of equal «. 

The differences in these images show that this simple tone re- 
production operator acts as both an exposure control and a con- 
trast compressor. The tone operator is notably lacking in spatial 
effects. The very dim images should also be blurry, as Lhe eye's 
resolution fades with decreasing light. The brilliant images 
should be more harsh, but glare and diffraction effects are also 
missing. The operator is also flawed for very dim images; far 
S< 27 dB (Equation 11), the slope of the R-B line in Figure 4 is 
negative. We made no attempt to model the limits of latitude in 
the observer models or display, so clipping occurs in the brighter 
images. However, even this simple operator appears to be a 
consistent and plausible solution to the display range problem. 

Faster^ utrark 

We have shown by example that good tone reproduction does 
not require ad-hoc methods and subjective judgements. We 
achieved plausible "hands-off" results with a small amount of 
computation. More sophisticated observer models should in- 
crease the accuracy of the displayed image by including more 
light -de pen dent effects of vision, especially extensions to color 
and spatial filtering. Even without noticeable improvements in 
brightness reproduction, these methods remove some guess- 
work from r adiosity image display and demonstrate that tone re- 
production operators need not be constrained to the same 
corrections used for film. Because any tone reproduction oper- 
ator explicitly maps the results of a global illumination method 
to the display device, it also specifies the accuracy required in the 
illumination calculations themselves. Since computational costs 
rise rapidly as error tolerance falls, researchers involved in cur- 
rent efforts to develop efficient, realistic, image-synthesis systems, 
need to explore tone reproduction operators. O 
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